-
Notifications
You must be signed in to change notification settings - Fork 236
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[NHWC] asm igemm xdlops fwd/bwd fp32/fp16 kernel #958
Conversation
Congratulations 🎉. DeepCode analyzed your code in 2.149 seconds and we found no issues. Enjoy a moment of no bugs ☀️. 👉 View analysis in DeepCode’s Dashboard | Configure the bot👉 The DeepCode service and API will be deprecated in August, 2021. Here is the information how to migrate. Thank you for using DeepCode 🙏 ❤️ !If you are using our plugins, you might be interested in their successors: Snyk's JetBrains plugin and Snyk's VS Code plugin. |
@carlushuang Could you please add urgency and value labels. Thanks. |
This comment has been minimized.
This comment has been minimized.
@atamazov Hi please help review this PR, now this PR have passed CI for once and framework team need this feature to do NHWC e2e test |
I have little idea of how to interpret/validate assembly kernels. Abstain from reviewing this PR, but thanks for your contribution. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM. I've only reviewed the host side code.
[2021-06-17T09:09:23.503Z] Backward weights convolution: ConvAsmBwdWrW1x1 |
CI all passed |
@junliume Thanks for the info from CI! Please note that correct quotation from the log is:
and faulty Solver is ConvOclDirectFwd1x1. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Blocked by #991
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
None objections!
@carlushuang ...but I still need an answer for this: #958 (comment) |
# RESOLVED Conflicts: # src/solver.cpp
CI run 30 has passed all the tests. |
* add NHWC fwd/bwd fp32/fp16 kernel * fix several bug in Herustic and Tuning * add missing kernel and update config list * remove useless config lists * fix fwd fp32 not valid config * fix isValid * fix bwd fp16 not proper IsApplicable * add ctest for nhwc asm kernels * reorg NextLinear * parse in opArgs as mutable, reserve place for in/wei/out pointer outside lambda * Assign OpKernelArg() to opArgs vector, set proper ctest flag
This is PR of asm igemm for NHWC layout. Currently this is for fwd/bwd, fp32/fp16.
ConvAsmImplicitGemmGTCDynamicFwdXdlopsNHWC
ConvAsmImplicitGemmGTCDynamicBwdXdlopsNHWC
tested these cases: perf_data
Most of the files in this PR is generated by iGEMMgen, with a single
.s
file for a single kernel. Apart from this, only several cpp files have changes. Below is list of non-asm files that have changes or added